PRAM and Disclosure Risk

نویسندگان

  • Peter-Paul de Wolf
  • JM VOORBURG
چکیده

Statistical Disclosure Control is part of the usual process, necessary for the dissemination of microdata. Usually, direct identifiers are removed and variables that can be used to re-identify certain records are recoded or suppressed. For certain surveys these methods are inadequate to produce safe microdatasets with enough detail, to satify both the supplier and the user of these sets. To that end, data perturbation techniques were developed. These techniques are meant to maintain more detail, while at the same time enough disclosure control is achieved. Basically, these methods can be devided into two groups. The first group consists of methods that attempt to ensure anonymity of the respondents during the interview, e.g., randomised response. The second group consists of methods that can be considered as part of the editing process, e.g., resampling, dataswapping and noise addition. PRAM (Post Randomisation Method) is a perturbation technique that has connections to both groups of methods. In one way it can be considered to be similar to randomised response. However, contrary to standard randomised respons techniques, PRAM is applied after the interview has been completed. Moreover, it can be considered to be some sort of missclassification as well, with known missclassification probabilities. Disclosure risk in case of the traditional methods like recoding and suppression, is usually defined in terms of the number of occurrences of certain combinations of categories. Often a simple rule is applied: whenever the frequency of such a combination in the population is above a certain threshold, that combination is considered to be safe. Intuitively this means, that whenever there are enough respondents satisfying that particular combination of categories, the uncertainty when matching such records with another file is still large enough. In case of PRAM, this rule can not be applied. In this paper we will propose a different approach to quantify the disclosure risk when applying PRAM.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Logistic Regression with Variables Subject to Post Randomization Method

An increase in quality and detail of publicly available databases increases the risk of disclosure of sensitive personal information contained in such databases. The goal of Statistical Disclosure Control (SDC) is to develop methodology that aims at minimizing disclosure risk while providing society with as much information as possible needed for valid statistical inference. The Post Randomizat...

متن کامل

An empirical evaluation of PRAM (Discussion paper 04012)

The views expressed in this paper are those of the authors and do not necessarily reflect the policies of Statistics Netherlands Explanation of symbols. = data not available * = provisional figure x = publication prohibited (confidential figure) – = nil or less than half of unit concerned – = (between two figures) inclusive 0 (0,0) = less than half of unit concerned blank = Due to rounding, som...

متن کامل

Preserving Edits When Perturbing Microdata for Statistical Disclosure Control Ntalie Shlomo, Ton De Waal

To protect individuals in microdata from the risk of re-identification, a general perturbative method called PRAM (the Post-Randomization Method) is sometimes used for masking records. This method adds “noise” to categorical variables by changing values of categories for a small number of records according to a prescribed probability matrix and a stochastic process based on the outcome of a ran...

متن کامل

On Invariant Post Randomization for Statistical Disclosure Control

In this paper, we investigate certain operational and inferential aspects of invariant PRAM (post randomization method) as a tool for disclosure limitation of categorical data. Invariant PRAMs preserve unbiasedness of certain estimators, but inflate their variances and distort other attributes. We introduce the concept of strongly invariant PRAM, which does not affect data utility or the proper...

متن کامل

domestic and international regulations and standards for risk disclosure in banks

Reporting by stakeholder groups, especially shareholders, has always been a demand And reporting and disclosure for the banking network is important. In Iran, banks require disclosing and reporting information and financial and economic events, but there are many international rules and standards for this disclosure. In addition, domestic regulations and requirements are also unclear due to the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003